Goto

Collaborating Authors

 fictitious play




60ce36723c17bbac504f2ef4c8a46995-AuthorFeedback.pdf

Neural Information Processing Systems

Also, inrecent years there has been growing synergybetween the game theory6 community andthebroader machine learning community. We'll include this discussion in the final version of the paper.[Re"...n-player27 variants..."] Interesting question! "...fictitious play..."] Yes, fictitious play could be used in this case.40 Going forward, an interesting challenge will be around employing abstraction and approximation55 techniques that will allow one to scale to larger games and construct mediators that can handle larger interactions.56




Exponential Lower Bounds for Fictitious Play in Potential Games

Neural Information Processing Systems

Fictitious Play (FP) is a simple and natural dynamic for repeated play with many applications in game theory and multi-agent reinforcement learning. It was introduced by Brown and its convergence properties for two-player zero-sum games was established later by Robinson. Potential games [Monderer and Shapley 1996] is another class of games which exhibit the FP property [Monderer and Shapley 1996], i.e., FP dynamics converges to a Nash equilibrium if all agents follows it.


Fictitious Play for Mean Field Games: Continuous Time Analysis and Applications

Neural Information Processing Systems

In this paper, we deepen the analysis of continuous time Fictitious Play learning algorithm to the consideration of various finite state Mean Field Game settings (finite horizon, $\gamma$-discounted), allowing in particular for the introduction of an additional common noise. We first present a theoretical convergence analysis of the continuous time Fictitious Play process and prove that the induced exploitability decreases at a rate $O(\frac{1}{t})$. Such analysis emphasizes the use of exploitability as a relevant metric for evaluating the convergence towards a Nash equilibrium in the context of Mean Field Games. These theoretical contributions are supported by numerical experiments provided in either model-based or model-free settings. We provide hereby for the first time converging learning dynamics for Mean Field Games in the presence of common noise.


High-dimensional Mean-Field Games by Particle-based Flow Matching

Yu, Jiajia, Lee, Junghwan, Xie, Yao, Cheng, Xiuyuan

arXiv.org Machine Learning

Mean-field games (MFGs) study the Nash equilibrium of systems with a continuum of interacting agents, which can be formulated as the fixed-point of optimal control problems. They provide a unified framework for a variety of applications, including optimal transport (OT) and generative models. Despite their broad applicability, solving high-dimensional MFGs remains a significant challenge due to fundamental computational and analytical obstacles. In this work, we propose a particle-based deep Flow Matching (FM) method to tackle high-dimensional MFG computation. In each iteration of our proximal fixed-point scheme, particles are updated using first-order information, and a flow neural network is trained to match the velocity of the sample trajectories in a simulation-free manner. Theoretically, in the optimal control setting, we prove that our scheme converges to a stationary point sublinearly, and upgrade to linear (exponential) convergence under additional convexity assumptions. Our proof uses FM to induce an Eulerian coordinate (density-based) from a Lagrangian one (particle-based), and this also leads to certain equivalence results between the two formulations for MFGs when the Eulerian solution is sufficiently regular. Our method demonstrates promising performance on non-potential MFGs and high-dimensional OT problems cast as MFGs through a relaxed terminal-cost formulation.


A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning

Neural Information Processing Systems

In this paper, we first observe that policies learned using InRL can overfit to the other agents' policies during training, failing to sufficiently generalize during execution. We introduce a new metric, joint-policy correlation, to quantify this effect.


Perturbing Best Responses in Zero-Sum Games

Dziwoki, Adam, Horcik, Rostislav

arXiv.org Artificial Intelligence

This paper investigates the impact of perturbations on the best-response-based algorithms approximating Nash equilib-ria in zero-sum games, namely Double Oracle and Fictitious Play. More precisely, we assume that the oracle computing the best responses perturbs the utilities before selecting the best response. We show that using such an oracle reduces the number of iterations for both algorithms. For some cases, suitable perturbations ensure the expected number of iterations is logarithmic. Although the utility perturbation is computationally demanding as it requires iterating through all pure strategies, we demonstrate that one can efficiently perturb the utilities in games where pure strategies have further inner structure.